SIEVING AND THE ERDOS-KAC THEOREM 



Andrew Granville and K. Soundararajan 

Abstract. We give a relatively easy proof of the Erdos-Kac theorem via computing mo- 
ments. We show how this proof extends naturally in a sieve theory context, and how it leads 
to several related results in the literature. 



Let iv{n) denote the number of distinct prime factors of the natural number n. The average 
value of uj{n) as n ranges over the integers below x is 

I E = ; E E 1 = ^ E [^] = ; E (; + = logiog. + o(i). 



X 



It is natural to ask how uj{n) is distributed as one varies over the integers n < x. A. 
famous result of Hardy and Ramanujan [13] tells us that oj(n) ~ log log a; for almost all 
n < x; we say that uj{n) has normal order log log n. To avoid confusion let us state 
this precisely: given e > there exists x^ such that if x > Xe is sufficiently large, then 
(1 + e) log log X > uj(n) > (1 — e) log log x for all but at most ex integers n < x. The 
functions log log n and log log x are interchangeable here since they are very close in value 
for all but the tiny integers n < x. 

Their proof revolves around the following wonderful inequality which they established 
by induction. Define 7rfc(x) to be the number of integers n < x with t<;(n) = k. There exist 
constants cq, ci > such that for any k > we have 

X (log log X +ci)'=-^ 

1 TTfc X < Cq- — , 

logx (A; — Ij! 

for all X > 2. Hardy and Ramanujan exploited this by deducing that 

X ^ (loglogx +ci)^-i 



log X (k — 1)\ 

|fc — log log x| >e log log X j fc— log log x\>e log log x 
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which is easily shown to be about x/(logx)" where a = ctg = e2/2 + 0{e^), far less than 
ex. In fact Hardy and Ramanujan squeezed a little more out of this idea, showing that if 
K{n) ^ oo as n — > oo, no matter how slowly, then 

(2) l^^l^) ~ loglogn| < K{n) \/log logn 

for almost all integers n < x. 

Once we know that Lj{n) has normal order log log n, we can ask finer questions about 
the distribution of a;(n). For instance how is a; (n) — log log n distributed? More specifically, 
how big is this typically in absolute value? Turan [33] found a very simple proof of the 
Hardy-Ramanujan result by showing that 

(3) i ^(cc;(n) - loglogn)^ = {1 + o(l)}loglogx. 

n<x 

One deduces easily that uj{n) has normal order log log n: For, if there are me{x) integers 
< X for which |a;(n) — loglognj > eloglogx then by (3), m^{x) < + o{l))x/ loglogx, 
which is < ex for sufficiently large x. Indeed the same argument also gives (2) for almost 

all n < X. 

We have now obtained some information about the distribution of Lo{n), its average 
value, and the average difference between the value and the mean. Next we ask whether 
there is a distribution function for a;(n)? In other words if, typically, the distance between 
uj{n) and log log n is roughly of size y/log log n can we say anything about the distribution 
of 



(4) 



u>{n) — log log n ^ 
Vlog log n 



In the late 1930s Mark Kac noticed that these developments bore more than a passing 
resemblance to developments in probability theory. He suggested that perhaps this distri- 
bution is normal and even conjectured certain number theory estimates which would imply 
that. Soon after describing this in a lecture, at which Paul Erdos was in the audience, 
Erdos and Kac were able to announce the result [7] : For any r e M, the proportion of the 
integers n < a; for which a;(n) < log log n + Ty/log log n tends to the limit 



as a; ^ oo. In other words the quantity in (4) is distributed like a normal distribution with 
mean and variance 1. 

Erdos and Kac's original proof was based on the central limit theorem, and Brun's sieve. 
A different proof follows from the work of Selberg [30] (extending and simplifying the work 
of [29]) who obtained an asymptotic formula for nk{x) uniformly in a wide range of k. Yet 
a third proof is provided by Halberstam [11] who showed how to compute the moments 

(6) J^(a;(n) -logloga;)'^, 

n<x 
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for natural numbers k, and showed that these agreed with the moments of a normal 
distribution. Since the normal distribution is well-known to be determined by its moments, 
he deduced the Erdos-Kac theorem. 

In this article, we give a simple method to compute the moments (6), and in fact we can 
obtain an asymptotic formula uniformly in a wide range of k. Then we discuss how such 
moments can be formulated for more general sequences assuming sieve type hypotheses. 

Theorem 1. For any natural number k we let Ck = T{k + l)/{2''^^T{k/2 + l)). Uniformly 
for even natural numbers k < (log log x) 3 we have 

k^ 



V(a;(n) - log log x)'^ = Cfcx(logloga;)'=/2(l + o(-=^=)), 
^ \ \ V los log xJ f 

n<x V u u 

and uniformly for odd natural numbers k < (log log x) ^ we have 
^{Lu{n) - log log a;)'' < Cfea;(logloga;)'=/^ 



/2 k2 



We will deduce this Theorem from the following technical proposition. 
Proposition 2. Define 



1 - ^ ifp\n 



Let z > 10^ be a real number. Uniformly for even natural numbers k < (log log 2)3 we 
have 

n<x p<z 

while, uniformly for odd natural numbers k < (log log z)^ , we have 

(7b) E (E/p("))' « gfe^(loglogz)^/^ Jl + 2fe7r(z)fe. 

Deduction of Theorem 1. We seek to evaluate 5^^<^(a;(n) — logloga;)'^ for natural numbers 
k < (log log x) 3. Set z = x^ and note that, for n < x, 

a;(n) -loglogx = E/p(^)+ E 1 + (E "^/-^ ~ ^°S^°S^) ^ E ■^p('^) + ^^C^)' 

p<z p\n,p>z p<z p<z 
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Thus for some positive constant c we obtain that 

{u;{n) - log log x)^ = ( 5] /,(n)) ' + O ( {ck)'-' (^) | J] /,(n) |') . 

p<z e=o ^ ^ p<z 

When we sum this up over all integers n < x the first term above is handled through 

e 

(7a, b). To handle the remainder terms we need to estimate J2n<x ^p<z fpi^) 

i < k — 1. When i is even this is once again available through (7a). Suppose £ is odd. By 
Cauchy-Schwarz we get that 

E|E«»)|'s(E(E«"))"y(E(E«"))'")*. 

n<x p<z n<x p<z n<x p<z 

and using (7a) we deduce that this is 

< v/C^_iC£+ia;(loglog2;)^/^. 

Proof of Proposition 2. If r = YliPf* ^^e prime factorization of r we put fr{n) = 
Hi Then we may write 

n<x p<z Pi,--- ,Pk^z n<x 

To proceed further, let us consider more generally ^^<2, fri^). 

Suppose r = 11^=1 ^f' where the qi are distinct primes and > 1. Set R = Y[i=i 1i 
and observe that ii d — (n, R) then frip) = fr{d). Therefore, with r denoting the divisor 
function, 

E/^(-) = E^(^) E ^ = J:fr(d){-d^^ + Oir{R/d))) 

n<x d\R n<x d\R ' 

{n,R)=d 

= ^Y.fr{dMR/d) + 0{T{R)). 

d\R 

Thus seting 

d\R g«||r 

we conclude that 

Y,fr{n)^G{r)x + 0{T{R)). 

n<x 
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Observe that G{r) = unless r is square-full and so 



n<x p<z pi,... ,pk<z 

Pi" 'Pk square- full 

Suppose qi < q2 < . . . < Qs are the distinct primes in pi ■ ■ -pf.. Note that since p\- ■ - pk is 
square-full we have s < k/2. Thus our main term above is 

s<k/2 qi<q2---<qs<z ai,... ,as>2 
Ej a{=fc 

When /c is even there is a term s = k/2 (and all cuj = 2) which gives rise to the Gaussian 
moments. This term contributes 



E n|(i-^)- 



2fc/2(/,/2)! ^ 11 g. 

gi,... ,qfe/2<2 »=1 
distinct 

By ignoring the distinctness condition, we see that the sum over q^s is bounded above by 
(^p<zi^ — On the other hand, if we consider gi, qk/2-1 as given then 

the sum over 5^/2 is plainly at least J2tt^,^2<p<z^^ ~ ^/p)/P where we let tt^ denote the 
n-th smallest prime. Repeating this argument, the sum over the g's is bounded below by 
iYl7Vk/2<p<z^^ ~ l/p)/]?)^/^. Therefore the term with s — k/2 contributes 

j^l / ■.k/2 

(9) ^^^^^(loglog^ + 0(l + loglog/c)j . 

To estimate the terms s < k/2 we use that < G{q'^^ ■ ■ -qf") < ■ ■ - Qs) and so 

these terms contribute 

s<k/2 q<z ai,...,as>2 

The number of ways of writing k = ai + . . . + ag with each ct^ > 2 equals the number of 
ways of writing k — s — a'i + . . . + a'g where each a[>l and is therefore i^^^) ■ Thus these 
remainder terms contribute 

(10) 2 E + 

s<k/2 ' ^ ^ 

Proposition 2 follows upon combining (8), (9), and (10). 
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The main novelty in our proof above is the introduction of the function fr{n) whose 
expectation over integers n below x is small unless r is square-full. This leads easily to 
a recognition of the main term in the asymptotics of the moments. Previous approaches 
expanded out {uj{n) — log log x)'^ using the binomial theorem, and then there are several 
main terms which must be carefully cancelled out before the desired asymptotic emerges. 
Our use of this simpler technique was inspired by [25]. Recently Rizwanur Khan [17] 
builds on this idea to prove that the spacings between normal numbers obey a Poisson 
distribution law. 

This technique extends readily to the study of u{n) in many other sequences. We 
formulate this in a sieve like setting: 

Let A — {tti, . . . , ttx} be a (multi)-set of x (not necessarily distinct) natural numbers. 
Let Ad = #{?^ < X : d\an}- We suppose that there is a real valued, non-negative 
multiplicative function h{d) such that for square-free d we may write 

h{d) 

Ad = -^X + Td. 

d 

It is natural to suppose that ^ <h{d) < d for all square-free d, and we do so below. Here 
Td denotes a remainder term which we expect to be small: either small for all d, or maybe 
just small on average over d. 

Let V be any set of primes. In sieve theory one attempts to estimate #{n < x : 
(a„, m) = 1} for m = ripeP^*' terms of the function h and the error terms rd- Here we 
want to understand the distribution of values of uj'p{a), as we vary through elements a of 
A, where U'p{a) is defined to be the number of primes p E V which divide a. We expect 
that the distribution of u>-p{a) is normal with "mean" and "variance" given by 

^^v■.= y:^ and a^^:=T^(l-^ 

and wish to find conditions under which this is true. There is a simple heuristic which 
explains why this should usually be true: Suppose that for each prime p we have a sequence 
of independent random variables bi^p, . . . , bx,p each of which is 1 with probability h{p)/p 
and otherwise; and we let bj be the product of the primes p for which bj^p — 1. The 
bj form a probabilistic model for the aj satisfying our sieve hypotheses, the key point 
being that, in the model, whether or not bj is divisible by different primes is independent. 
One can use the central limit theorem to show that, as a; — > oo, the distribution of uj-p{b) 
becomes normal with mean ji-p and variance cr|,. 

2 

Proposition 3. Uniformly for all natural numbers k < a^, we have 

J2{^v{a)-iivy ^Ckxa!^(l + o(^^'^^+o(^lif^ J] \rd\), 

aeA deDkiV) 
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if k is even, and 



^ (uj-p{a) - nr) < Ckxa^ — + fj,^ ^ \rd\, 

if k is odd. Here {V) denotes the set of squarefree integers which are the product of at 
most k primes all from the set V . 

Proof. The proof is similar to that of Proposition 2, and so we record only the main 
points. We define fp{a) = 1 — h{p)/p if p\a and —h{p)/p \ip\ a. If r = Hi Pf' is the prime 
factorization of r we put /r(a) = Jli/pil^)"*- Note that uj-p{a) — /i-p = X^^g-p /p(ci), and 
so 

(11) ('^^(^) ~^^) = J2^p^-Pk(^^- 

a€A pi,... ,Pk€V a€A 

As in Proposition 2, consider more generally ^^eA f^i'^)- Suppose r = 111=1 ^f* where 
the qi are distinct primes and each ctj > 1. Set R = 111=1 Qi observe that if d = (a, R) 
then fr{a) = fr{d)- Note that 



E 1 = E E /^(e) = E /^(^)-^'^ 

aeA a£A e\{R/d) e\R/d 

{a,R)=d de\n 



= X 



d V p 

p\{R/d) ^ e\{R/d) 



Therefore 



E^(«) = E^(^) E 1 

aeA d\R aeA 

{a,R)=d 



d\R p\{R/d) d\R e\(R/d) 

(12a) = G{r)x + r mE(r, m), 



m\R 



where 



(12b) GM=n(^o-^r-(^)"(i-^))^ 



Q 

q"\\r 



and 



</"||r, q\m " q"\\r, q\(R/m) ^ 



8 



ANDREW GRANVILLE AND K. SOUNDARARAJAN 



We input the above analysis in (11). Consider first the main terms that arise. Notice 
that G{r) = unless r is square-full, and so the main terms look exactly like the cor- 
responding main terms in Proposition 2. We record the only small difference from the 
analysis there. When k is even there is a leading contribution from the terms with s = k/2 
and all cui = 2 (in notation analogous to Proposition 2); this term contributes 

k\ ^ -Q /^(Qz) /-^ h{qi) \ 



qi distinct 

The sum over q''s is bounded above by Up, and is bounded below by 

where we let Tin{V) denote the n-th smallest prime in V and made use of the fact that 
< {h(j))/p){l — h{p)/p) < 1/4. The remainder of the argument is exactly the same as in 
Proposition 2. 

Finally we need to deal with the "error" term contribution to (11). To estimate the 
error terms that arise in (11), we use that \E{pi- ■ ■pk,m)\ < Y[pi\m^(P'i)/Pi- Thus the 
error term is 



k 



< 



E E w E 

£=1 m=qi...qe>l Pi,--- ^Pk^V Pi\m * 

qi<q2<---<qe€'P Tn\pi---pk 



Fix m and let ej — #{i ■ Pi — Qj} for each j, I < j < i- Then there are eo := 
k — (ei + • ■ ■ + ee) < k — £ primes pi which are not equal to any qj , and so their contribution 
to the final sum is < /j,^ . Therefore the final sum is 

<- E C).^" E 

0<eo<k-e ^ ei + ---+ee=k-eo 

each ei>l 



0<eo<fc-l 



since k^ < < /i-p. This completes the proof of the Proposition. 

One way of using Proposition 3 is to take V to be the set of primes below z where z is 
suitably small so that the error term arising from the jr^l's is negligible. If the numbers a 
in A are not too large, then there cannot be too many primes larger than z that divide a, 
and so Proposition 3 furnishes information about a; (a). Note that we used precisely such 
an argument in deducing Theorem 1 from Proposition 2. 
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In this manner, Proposition 3 may be used to prove the Erdos-Kac theorem for many- 
interesting sequences of integers. For example, Halberstam [12] showed such a result for 
the shifted primes p — 1, which the reader can now deduce from Proposition 3 and the 
Bombieri-Vinogradov theorem. 

Similarly, one can take A — {f{n) : n < x} for f{t) e Z[t]. In this case h{p) is bounded 
by the degree of / except at finitely many primes, and the prime ideal theorem implies 
that //-p, CTp = m log log a; + 0(1) where m is the number of distinct irreducible factors of 
/. Again this example was first considered by Halberstam [12]. 

AUadi [1] proved an Erdos-Kac theorem for integers without large prime factors. Propo- 
sition 3 reduces this problem to obtaining information about multiples of d in this set of 
"smooth numbers." We invite the reader to fill in this information. 

In place of a; (a) we may study more generally the distribution of values of g{a) where 
g is an "additive function." Recall that an additive function satisfies ^(1) = 0, and 
g{mn) = g{m) + g{n) whenever m and n are coprime. Its values are determined by the 
prime-power values g{p^). If in addition g{p^) — g{p) for all /c > 1 we say that the function 
g is "strongly additive." The strongly additive functions form a particularly nice subclass 
of additive functions and for convenience we restrict ourselves to this subclass. 

Proposition 4. Let A be a (multi)-set of x integers, and let h{d) and be as above. Let 
V be a set of primes, and let g be a real-valued, strongly additive function with \gip)\ < M 
for all p & V. Let 



per ^ per ^ ^ 

2 

Then, uniformly for all even natural numbers k < {a-p{g)/M)3 , 

aeA p\a per ^ deOkir) 

per 

2 

while for all odd natural numbers k < {a-p{g) /M)^ ^ 

E(E^w-M«))'«c.w^+M'(E^)' E \rA- 

aeA p\a per ^ deDu{r) 

per 

Proof. We follow closely the proofs of Propositions 2 and 3, making appropriate modifica- 
tions. Let fr{p) be as in the proof of Proposition 3. Then we wish to evaluate 

aeA per pi,---,Pker aeA 
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We may now input the results (12a,b,c) here. Consider first the error terms that arise. 
Since \g{p)\ < M for all p E V this contribution is at most times the corresponding 
error in Proposition 3. To wit, the error terms are 



As for the main term, note that G{r) = unless r is square-full and so if qi < q2 <■■■< qa 
are the distinct primes among the pi, . . . , pk our main term is 

I.] « 

(13) xY. Y. T. -ri:^Ilo(^'rG(,r ■■■€■)■ 

s<k/2qi<---<qs ai,... ,as>2 * i=l 

When k is even there is a term with s = k/2 and all cuj = 2 which is the leading 
contribution to (13). This term contributes 

qi,--- ,qk/2ev 1=1 

Qi distinct 

If we fix Qi, . . . , gfe/2-1, then the sum over 5^/2 is (J-p^gY + 0{M'^k), since \g{p) \ < M for 
all p eV, and < h{p) < p. Therefore the contribution of the term s = k/2 to (13) is 

CkX [avigf + OiM^k)) = Ckxaj, {gf(l + ^ (^^) ) , 
since kM < a-p{g). 

Now consider the terms s < k/2 in (13). Since \G{q^^ ■■■'?"'')| < 11^=1 ~ 
h{qi)/qi), and 111=1 \9iQi)\"' < M^~'^^ 111=1 Ifl'(Q'i)P) we see that these terms contribute an 
amount whose magnitude is 

^^E^;--(Ei.(«)i^^(-^))' E ^ 

E oii=k 



2- E sICT) 



8<k/2 

using that equals the number of ways of writing A; = ^ojj with each cuj > 2. The 

Proposition follows. 

One way to apply Proposition 4 is to take V to be the set of all primes below z with 
\g{p)\ being small. If there are not too many values of p with \g{p)\ being large, then 
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we would expect that g{a) is roughly the same as g-pi.c') for most a. In such situations, 
Proposition 4 which furnishes the distribution of gv{o) would also furnish the distribution 
of g{a). In this manner one can deduce the result of Kubilius and Shapiro [31] which is 
a powerful generalization of the Erdos-Kac theorem for additive functions. Indeed we can 
derive such a Kubilius-Shapiro result in the more general sieve theoretic framework given 
above, and for all additive functions rather than only for the subclass of strongly additive 
functions. 

There are many other interesting number theory questions in which an Erdos-Kac type 
theorem has been proved. We have collected some of these references below^ and invite 
the reader to determine which of these Erdos-Kac type theorems can be deduced from the 
results given herein. The reader may also be interested in the textbooks [4, 18, 32] for a 
more classical discussion of some of these issues, and to the elegant essays [2, 16]. 
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